
GAATTCCGGGTGGGTAGGTCGGGCAGGGTAGGACAGGCCTAAGAGAGAGGCCGGACAGACCTCC 
TTTGGAAGCAGCCACTTCTGGTCCCCATCCCTGGAGCGATCGAGCGCAGGATCTGCTGTCCCAT 
GGGACAGCAGATCTCTCTTCCCAGTGCACAGCTTCTCCTCTGCCTGTTTTCCCTGCTTCCAGTG 
CTCCAGGTGGCCCAACCAGGCCAGGCACCCCAGGACCAGCCCCTGTGGACACTTTTGGAGCAGT 
ACTGCCACAGGACCACAATTGGGAATTTTTCAGGTCCCTACACCTACTGCAACACGACCTTGGA 
CCAGATCGGGACCTGCTGGCCACAGAGCGCACCCGGAGCCCTAGTAGAGAGACCGTGCCCCGAG 
TACTTCAATGGCATCAAGTACAACACGACCCGGAATGCCTACAGAGAGTGCCTGGAGAACGGGA 
CCTGGGCCTCAAGGGTCAACTACTCACACTGCGAACCCATTTTGGATGACAAGCAGAGAAAGTA 
TGACCTGCATTACCGAATCGCCCTCATTGTCAACTACCTGGGTCACTGTGTTTCCGTGGTGGCC 
CTGGTGGCCGCTTTCCTGCTTTTCCTAGTGCTGCGGAGTATCCGCTGCCTGAGGAATGTGATCC 
ACTGGAACCTCATCACCACCTTCATTCTGAGAAACATCGCGTGGTTCCTGCTGCAACTCATCGA 
CCACGAAGTGCACGAGGGCAATGAGGTCTGGTGCCGCTGCATCACCACCATCTTCAACTATTTT 
GTGGTCACCAACTTCTTCTGGATGTTTGTGGAGGGCTGCTACCTGCACACGGCCATTGTCATGA 
CGTACTCCACAGAGCACCTGCGCAAGTGGCTTTTCCTCTTCATTGGATGGTGCATTCCCTGCCC 
TATCATCATCGCCTGGGCAGTTGGCAAACTCTACTATGAGAATGAGCAGTGCTGGTTTGGCAAG 
^ GAAGCTGGTGATTTGGTGGACTACATCTACCAGGGCCCCGTCATGCTTGTGCTGTTGATCAATT 
J TTGTATTTCTGTTTAACATCGTCAGGATCCTGATGACGAAGTTACGAGCATCCACCACGTCCGA 
3 GACAATCCAATACAGGAAGGCAGTGAAGGCCACGCTGGTCCTCCTCCCCCTGTTGGGCATCACC 
Q TACATGCTCTTCTTTGTCAATCCTGGCGAGGACGACCTGTCCCAGATTGTGTTCATCTACTTCA 

a actctttcctgcagtccttccagggtttctttgtgtccgttttctactgcttcttcaatg6aga 
-4 ggtgcgcgcggccctgagaaagcggtggcactcggggcaggaccaccacgccctccgggtgqct 
Q gtgcgccgggccatgtccatccctacgtcgcccaccaggatcagcttccacagcatcaagcaga 
<=» cagctgctgtgtgaccctctgtcaccgtctgcccggcagtccaccactgaggcagcttctccat 
cctttacagccttcccctgggtcctccttgctaccctgacccacaggtacaaggtacaggagaa 
y gggaggagaacgaacactccc (seq id n0:1) 

o 



FIGURE 1 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



GAATTCCGGGTGGGTAGGTCGGGCAGGGTAGGACAGGCCTAAGAGAGAGGCCGGACAGAC 
C TCC TTTGGAAGC AGC C AC TTCTGGTCCC C ATC CCTGGAGCGATCGAGCGC AGGATCTGC 
TGTCCCATGGGACAGCAGATCTCTCTTCCCAGTGCACAGCTTCTCCTCTGCCTGTTTTCC 
CTGCTTCCAGTGCTCCAGGTGGCCCAACCAGGCCAGGCACCCCAGGACCAGCCCCTGTGG 
AC ACTTTTGGAGC AGTAC TGCC AC AGGAC C AC AATTGGGAATTTTTC AGGTC CCTAC ACC 
TACTGCAACACGACCTTGGACCAGATCGGGACCTGCTGGCCACAGAGCGCACCCGGAGCC 
CTAGTAGAGAGACCGTGCCCCGAGTACTTCAATGGC ATCAAGTAC AAC ACGACCC E GGAA 
TGCCTACAGAGAGTGCCTGGA] GAACGGGACCTGGGCCTCAAGGGTCAACTACTCACACT 
GCGAACCCATTTTGGATGACAAGCAGAGAAAGTATGACCTGCATTACCGAATCGCCCTCA 
TTGTCAACTACCTGGGTCACTGTGTTTCCGTGGTGGCCCTGGTG [GCCGCTTTCCTGCTT 
TTCCTAGTGCTGCG ] GAGTATCCGCTGCCTGAGGAATGTGATCCACTGGAACCTC ATCAC 
CACCTTCATTCTGAGAAACATCGCGTGGTTCCTGCTGCAACTCATCGACCACGAAGTGCA 
CGAGGGC AATGAGGTC TGGTGC CGC TGC ATC AC C AC C ATC TTC AAC T ATTTTGTGGTC AC 
CAACTTCTTCTGGATGTTTGTGGAGGGCTGCTACCTGCACACGGCCATTGTCATGACGTA 
CTCCACAGAGCACCTGCGCAAGTGGCTTTTCCTCTTCATTGGATGGTGCATTCCCTGCCC 
TATCATCATCGCCTGGGCAGTTGGCAAACTCTACTATGAGAATGAGCAGTGCTGGTTTGG 
CAAGGAAGCTGGTGATTTGGTGGACTACATCTACCAGGGCCCCGTCATGCTTGTGCTGTT 
GATCAATTTTGTATTTCTGTTTAACATCGTCAGGATCCTGATGACGAAGTTACGAGCATC 
CACCACGTCCGAGACAATCCAATACAGGAAGGCAGTGAAGGCCACGCTGGTCCTCCTCCC 
CCTGTTGGGCATCACCTACATGCTCTTCTTTGTCAATCCTGGCGAGGACGACCTGTCCCA 
GATTGTGTTCATCTACTTCAACTCTTTCCTGCAGTCCTTCCAGGGTTTCTTTGTGTCCGT 
TTTC T ACTGC TTCTTC AATGG AG AGGTGC GC GC GGC CC TG AG AAAGC GGTGGC AC TC GGG 
GCAGGACCACCACGCCCTCCGGGTGCCTGTGCGCCGGGCCATGTCCATCCCTACGTCGCC 
CACCAGGATCAGCTTCCACAGCATCAAGCAGACAGCTGCTGTGTGACCCTCTGTCACCGT 
CTGCCCGGCAGTCCACCACTGAGGCAGCTTCTCCATCCTTTACAGCCTTCCCCTGGGTCC 
TCCTTGCTACCCTGACCCACAGGTACAAGGTACAGGAGAAGGGAGGAGAACGAACACTCC 
C 



FIGURE 2A 



Gene Sequence Structure 



441 bp 



Sequence Deleted 



582 bp 



Size of full-length 
cDNA: 1557 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 3050 



Arm Length: 
^: 3.1 kb 
E f: 2.5 kb 



5* arm 



LacZ-Neo 
Cassette 



3' arm 




P=! Targeting Vector 

- — — — - Endogenous Locus 

^* Not drawn to scale 



5 1 >AGCCCTATGTGTAATTTTCAT 
ATAAATGAC TC ATATT AGC TTTC A 
GATATGCATTGTGTTTTCAGGTCT 
GGGAGAACTAAGGAGTGTGGACCT 
T ATC C TGC AGGT AC T AGGG AGCC A 
GGGAGGGCTTTTGAGGCGGGAGGG 
CGTCCTGACTCTCAGTGGTTGGCA 
TC TTC TC TAGGG AATGCC T AC AG A 
GAGTGCCTGGA<3 1 
(SEQ ID NO: 2) 



5 1 >GCCGCTTTCCTGCTTTTCCTA 
GTGCTGCGGTGAGTCCACCTCCAC 
CCTGCTTCCTCCTTGTCTTTGCCT 
CTCCCAGACATTGTCTCTTCCATT 
CTGGGGCCCCGGGAACAGTAGCCA 
G AAGTGGGTTT AAGTC AGACC C C C 
AGGGCCATGACCACCAGCCTGCCT 
GAAGGGTAGAGAGC AAGCC C AGC T 
GGGACCACCAG< 3 1 
(SEQ ID NO: 3) 
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